Automatic Image Annotation and Retrieval Using the Latent Dirichlet Allocation Model

نویسندگان

  • Tong Zhang
  • Zhe-Ming Lu
  • Kap Luk Chan
  • Zhen Li
چکیده

Content-based image retrieval faces a vital problem, namely “semantic gap” that exists between low level features and semantic concept. In order to solve this problem, image automatic annotations that allow users to access a large image database with textual queries are put forward. In this paper, the main study concentrates on an automatic image annotation method based on vector quantization (VQ) algorithms and Latent Dirichlet Allocation (LDA) model. VQ is used as a clustering and condensing technique. LDA model is introduced from text based information retrieval, which is widely studied nowadays and proved to be effective on discrete data processing and demission reduction. The experiment is carried out on the platform of MATLAB and applied to the Corel database of 400 images. The presented experiment results demonstrate the effectiveness of the proposed method in its application of automatic image annotation and keyword based image retrieval.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy Neighbor Voting for Automatic Image Annotation

With quick development of digital images and the availability of imaging tools, massive amounts of images are created. Therefore, efficient management and suitable retrieval, especially by computers, is one of themost challenging fields in image processing. Automatic image annotation (AIA) or refers to attaching words, keywords or comments to an image or to a selected part of it. In this paper,...

متن کامل

Automatic Music Annotation

In the last ten years, computer-based systems have been developed to automatically classify music according to a high-level musical concept such as genre or instrumentation. These automatic music annotation systems are useful for the storage and retrieval of music from a large database of musical content. In general, a system begins by extracting features for each song. The labels and features ...

متن کامل

Automatic keyword extraction using Latent Dirichlet Allocation topic modeling: Similarity with golden standard and users' evaluation

Purpose: This study investigates the automatic keyword extraction from the table of contents of Persian e-books in the field of science using LDA topic modeling, evaluating their similarity with golden standard, and users' viewpoints of the model keywords. Methodology: This is a mixed text-mining research in which LDA topic modeling is used to extract keywords from the table of contents of sci...

متن کامل

Web-Scale Image Annotation

In this paper, we describe our experiments using Latent Dirichlet Allocation (LDA) to model images containing both perceptual features and words. To build a large-scale image tagging system, we distribute the computation of LDA parameters using MapReduce. Empirical study shows that our scalable LDA supports image annotation both effectively and efficiently.

متن کامل

Learning Semantics for Image Annotation

Image search and retrieval engines rely heavily on textual annotation in order to match word queries to a set of candidate images. A system that can automatically annotate images with meaningful text can be highly beneficial for such engines. Currently, the approaches to develop such systems try to establish relationships between keywords and visual features of images. In this paper, We make th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012